A machine learning approach to detecting fraudulent job types

نویسندگان

چکیده

Abstract Job seekers find themselves increasingly duped and misled by fraudulent job advertisements, posing a threat to their privacy, security well-being. There is clear need for solutions that can protect innocent seekers. Existing approaches detecting jobs do not scale well, function like black-box, lack interpretability, which essential guide applicants’ decision-making. Moreover, commonly used lexical features may be insufficient as the representation does capture contextual semantics of underlying document. Hence, this paper explores what extent different categorizations classified. In addition, seeks type are most relevant in classifying job. paper, we develop validate machine learning system identifying identity theft, corporate theft multi-level marketing amongst advertisements. We utilized four classes features: empirical rule set-based features, bag-of-word models, recent state-of-the-art word embeddings transformer models various classifiers. The were validated evaluating them on publicly available description dataset. Our results indicate transformer-based consistently outperformed handcrafted rule-set based class. Ultimately, Gradient Boosting classifier with combination parts-of-speech tags bag-of-words vectors achieved best performance an F1-score 0.88.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Detecting Encrypted Traffic: A Machine Learning Approach

Detecting encrypted traffic is increasingly important for deep packet inspection (DPI) to improve the performance of intrusion detection systems. We propose a machine learning approach with several randomness tests to achieve high accuracy detection of encrypted traffic while requiring low overhead incurred by the detection procedure. To demonstrate how effective the proposed approach is, the p...

متن کامل

Detecting Spam Blogs: A Machine Learning Approach

Weblogs or blogs are an important new way to publish information, engage in discussions, and form communities on the Internet. The Blogosphere has unfortunately been infected by several varieties of spam-like content. Blog search engines, for example, are inundated by posts from splogs – false blogs with machine generated or hijacked content whose sole purpose is to host ads or raise the PageRa...

متن کامل

A machine-learning approach to detecting unknown bacterial serovars

Technologies for rapid detection of bacterial pathogens are crucial for securing the food supply. A light-scattering sensor recently developed for real-time identification of multiple colonies has shown great promise for distinguishing bacteria cultures. The classification approach currently used with this system relies on supervised learning. For accurate classification of bacterial pathogens,...

متن کامل

Detecting Targets in SAR Images: A Machine Learning Approach

This paper describes a novel application of the MIST methodology to target detection in SAR images. Specifically, a polarimetric whitening filter and a constant false alarm rate detector are used to preprocess a SAR image; then the AQ15c learning program is applied to learn and detect targets. Encouraging and impressive experimental results are provided. KEYWORD: Learning in vision, target dete...

متن کامل

A Machine Learning Approach to No-Reference Objective Video Quality Assessment for High Definition Resources

The video quality assessment must be adapted to the human visual system, which is why researchers have performed subjective viewing experiments in order to obtain the conditions of encoding of video systems to provide the best quality to the user. The objective of this study is to assess the video quality using image features extraction without using reference video. RMSE values and processing ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: AI & society

سال: 2022

ISSN: ['0951-5666', '1435-5655']

DOI: https://doi.org/10.1007/s00146-022-01469-0